Babylab data - exploration
Since last time:
- Replaced task numbers by names
- Harmonised terminology
- Added image for saccade to target + task descriptions
- Added total looking time variable to analyses (correlations etc)
- Added detailed within groups analysis
To discuss:
- 166 missing reaction time target!! (I haden’t noticed…) + 16 missing reaction time pursuit, which to keep? reaction time scene?
- Generally discuss the final groups
- confirm that arithmetic mean OK to combine scores within group (normal distributions, no outliers, similar variance)
1 Intro
1.1 Context
One of the goals of sepages is to analyse the link between exposure to endocrine disruptors during pregnancy and the infant’s neurological development. One of the experiments to quantify neurologic development was to perform eye tracking experiments at Grenoble’s LPNC BabyLab. Sepages infants were subject to several tasks and were seen up to 3 times: at 5 months, 12 months, 24 months.
Here are the descriptive stats on the babylab dataset DataFile_030220_anonym_id.csv.
First we present the different tasks and scores. Then we briefly describe the population. Finally we decide on a subselection of scores for final analysis.
1.2 Eye tracking
Eye movement is measured by X-Y coordinates continuously. Eye movement consists of a sequence of saccades (movement) and fixations (non movements). Using the eye tracker several tasks are performed for which various scores were computed. Four tasks were performed for this study:
- scene perception
- face perception and recognition
- saccades to target
- smooth pursuit
1.2.1 Task 1: Scene perception (#scene)
An image is shown to the baby for five seconds each. The experiment is repeated six times.
We are interested in looking at what part of the image the infant looks at.
The scores available for this task are
- The number of saccades during the task (
sacc_n_scene) - The values of a and b in the equation v = b*d^a (
p1_scene&p2_scene) - The mean number of fixations over the six images (
fix_n_scene) - The mean total time spent looking at the image (
look_t_scene) - The mean duration of one fixation (
fix_dur_scene) - A score to compute if the child looks at the same place of the image or not as the other children (
map_scene)
1.2.2 Task 2: Face perception and recognition (#face)
Two pictures of the same human face are simultaneously shown to the baby for 5 seconds. Then, one picture is replaced by a new one (the faces have different levels of attractiveness) and the two different pictures are shown to the baby for 5 seconds. The experiment is repeated four times.
We are interested in measuring the reaction to novelty: the baby should preferentially look at the new face.
The scores available for this task are
- The number of saccades during the task (
sacc_n_face) - The values of a and b in the equation v = b*d^a (
p1_face&p2_face) - The mean number of fixations over the six images (
fix_n_face) - The mean total time spent looking at the image (
look_t_face) - The mean duration of one fixation (
fix_dur_face) - The reaction to novel face over four tests (
novelty_face) need more info on formula - Reaction time (
react_t_face) need more info on formula - Percent time spent looking at eyes (
pct_eyes_face) - Percent time spent looking at mouth (
pct_mouth_face)
1.2.3 Task 3: Saccades to target (#target)
Attention of child is fixed at the center of the screen with a target. When the attention of the child is fixed on the target in the middle of the screen, the target disappears and re-appears in one of the 8 peripheral positions. The child then saccades toward the new target until it fixes it. The target then switches to the middle and the experiences is repeated 8 times.
The scores available for this task are
- Reaction time (
reac_t_target) need more info on formula - Mean length of the first saccade to target (
dist_target)
1.2.4 Task 4: Smooth pursuit (#pursuit)
There is a central fixation point. Once the point disappears a new point appears on the right and starts to move. The child then pursues the target that does three circles (not sure of the N). Then the target reappears in the middle and the experiment is repeated 4 times.
[ADD IAMGE]
The scores available for this task are
- Reaction time (
reac_t_pursuit) need more info on formula
2 Population description
2.1 Individuals
- N unique individuals = 188
- Sex:
| n | % | val% | |
|---|---|---|---|
| F | 83 | 44.1 | 44.1 |
| M | 105 | 55.9 | 55.9 |
As a reminder in sepages: 251 boys (53%) and 218 girls (47%).
- Birth dates:
2.2 Experiments
- N experiments: 264
Task 2 missing for first 5 (not yet done) and when all signal was bad. (cf. notes meeting with DM 11/03/2020). Same for task 4. These tasks are more “difficult” so it’s harder to get valid signal.
2.3 Age at experiment
Older outlier:
## # A tibble: 1 x 4
## date_birth date_exp age_cat age_days
## <date> <date> <dbl> <dbl>
## 1 2016-12-07 2019-01-08 24 1097
True age in months = 1097 / 30.4 = 36…
Error? Exclude?
Three groups of age: 5 months, 12 months and 24 months with a majority of 24 months. Categorical age var:
| n | % | val% | |
|---|---|---|---|
| 5 | 46 | 17.4 | 17.4 |
| 12 | 67 | 25.4 | 25.4 |
| 24 | 151 | 57.2 | 57.2 |
2.4 Repeated measures
Number of repeated data:
- 118 children with 1 experiment
- 64 children with 2 experiments
- 6 children with 3 experiments
2.5 N with exposure
2.5.1 eye tracker 5 months
| period | n |
|---|---|
| exposure_T1 | 46/46 (100%) |
| exposure_T3 | 46/46 (100%) |
| exposure_M2 | 20/46 (43%) |
| exposure_Y1 | 14/46 (30%) |
2.5.2 eye tracker 12 months
| period | n |
|---|---|
| exposure_T1 | 67/67 (100%) |
| exposure_T3 | 67/67 (100%) |
| exposure_M2 | 14/67 (21%) |
| exposure_Y1 | 13/67 (19%) |
2.5.3 eye tracker 24 months
| period | n |
|---|---|
| exposure_T1 | 151/151 (100%) |
| exposure_T3 | 150/151 (99%) |
| exposure_M2 | 56/151 (37%) |
| exposure_Y1 | 40/151 (26%) |
3 Eye tracker scores
First we do some univariate statistics on the scores then we look at relations between them to see if we can/need to exclude some from further analysis.
3.1 Description
Data Frame Summary
scoresDimensions: 264 x 24
Duplicates: 0
| No | Variable | Label | Stats / Values | Freqs (% of Valid) | Graph | Missing | ||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | ident [character] | Unique child id | 1. 22692 2. 23250 3. 24921 4. 25043 5. 25130 6. 25509 7. 16606 8. 16958 9. 17154 10. 17465 [ 178 others ] |
|
0 (0%) | |||||||||||||||||||||||||||||||||||||||||||||
| 2 | S_4 [numeric] | Mean (sd) : 17.6 (7.7) min < med < max: 5 < 24 < 24 IQR (CV) : 12 (0.4) |
|
0 (0%) | ||||||||||||||||||||||||||||||||||||||||||||||
| 3 | S_5 [numeric] | AgeInDays | Mean (sd) : 560.5 (249.6) min < med < max: 105 < 737.5 < 1097 IQR (CV) : 383.2 (0.4) | 149 distinct values | 0 (0%) | |||||||||||||||||||||||||||||||||||||||||||||
| 4 | sacc_n_scene [numeric] | E1SaccNum (the number of saccades) | Mean (sd) : 82 (15.6) min < med < max: 38 < 81.5 < 170 IQR (CV) : 17.2 (0.2) | 62 distinct values | 0 (0%) | |||||||||||||||||||||||||||||||||||||||||||||
| 5 | p1_scene [numeric] | E1param1 (the value of a in v = b*d^a, with v = max displacement per sample in pixels and d = distance in pixels) | Mean (sd) : 0.7 (0.1) min < med < max: 0.6 < 0.7 < 0.9 IQR (CV) : 0.1 (0.1) | 262 distinct values | 0 (0%) | |||||||||||||||||||||||||||||||||||||||||||||
| 6 | p2_scene [numeric] | E1param2 (the value of b in v = b*d^a) | Mean (sd) : 0.7 (0.2) min < med < max: 0.2 < 0.7 < 1.4 IQR (CV) : 0.2 (0.3) | 264 distinct values | 0 (0%) | |||||||||||||||||||||||||||||||||||||||||||||
| 7 | fix_n_scene [numeric] | E1MeanFixNum (the mean number of fixation over 5 seconds of presentation for fixation quality == 1) | Mean (sd) : 9.1 (2.4) min < med < max: 2.5 < 9.3 < 14.5 IQR (CV) : 3.3 (0.3) | 73 distinct values | 0 (0%) | |||||||||||||||||||||||||||||||||||||||||||||
| 8 | fix_dur_scene [numeric] | E1MeanFixDur (the mean duration of one fixation for fixation quality == 1) | Mean (sd) : 370 (60.1) min < med < max: 272 < 358 < 697 IQR (CV) : 77 (0.2) | 151 distinct values | 0 (0%) | |||||||||||||||||||||||||||||||||||||||||||||
| 9 | map_scene [numeric] | E1MeanMapFix (the mean of density map values at fixations points for fixation quality == 1 | 2) | Mean (sd) : 0.1 (0) min < med < max: 0 < 0.1 < 0.1 IQR (CV) : 0 (0.2) | 264 distinct values | 0 (0%) | |||||||||||||||||||||||||||||||||||||||||||||
| 10 | look_t_all_scene [numeric] | E1MeanLTAll (the mean of LT per image for fixation quality == 1 | 2) | Mean (sd) : 4141.8 (311.5) min < med < max: 2515.3 < 4208 < 4916 IQR (CV) : 315.3 (0.1) | 238 distinct values | 0 (0%) | |||||||||||||||||||||||||||||||||||||||||||||
| 11 | sacc_n_face [numeric] | E2SaccNum | Mean (sd) : 97.8 (15.9) min < med < max: 46 < 99 < 148 IQR (CV) : 20.5 (0.2) | 69 distinct values | 9 (3.41%) | |||||||||||||||||||||||||||||||||||||||||||||
| 12 | p1_face [numeric] | E2param1 | Mean (sd) : 0.7 (0.1) min < med < max: 0.5 < 0.7 < 0.9 IQR (CV) : 0.1 (0.1) | 254 distinct values | 9 (3.41%) | |||||||||||||||||||||||||||||||||||||||||||||
| 13 | p2_face [numeric] | E2param2 | Mean (sd) : 0.8 (0.2) min < med < max: 0.3 < 0.8 < 1.6 IQR (CV) : 0.3 (0.3) | 254 distinct values | 9 (3.41%) | |||||||||||||||||||||||||||||||||||||||||||||
| 14 | fix_n_face [numeric] | E2MeanFixNum | Mean (sd) : 8.3 (2.3) min < med < max: 2 < 8.2 < 15.8 IQR (CV) : 2.9 (0.3) | 72 distinct values | 9 (3.41%) | |||||||||||||||||||||||||||||||||||||||||||||
| 15 | fix_dur_face [numeric] | E2MeanFixDur | Mean (sd) : 382.8 (66.8) min < med < max: 222 < 372 < 607 IQR (CV) : 80.5 (0.2) | 149 distinct values | 9 (3.41%) | |||||||||||||||||||||||||||||||||||||||||||||
| 16 | map_face [numeric] | E2MeanMapFix | Mean (sd) : 0.1 (0) min < med < max: 0 < 0.1 < 0.1 IQR (CV) : 0 (0.2) | 254 distinct values | 9 (3.41%) | |||||||||||||||||||||||||||||||||||||||||||||
| 17 | look_t_all_face [numeric] | E2MeanLTAll | Mean (sd) : 4008.2 (353.9) min < med < max: 2155 < 4081 < 4645.5 IQR (CV) : 334.5 (0.1) | 237 distinct values | 9 (3.41%) | |||||||||||||||||||||||||||||||||||||||||||||
| 18 | novelty_face [numeric] | E2Nov (the reaction to novel face over the four test [New-Old]./[New+Old]) | Mean (sd) : 0.1 (0.2) min < med < max: -0.4 < 0.1 < 0.6 IQR (CV) : 0.2 (1.8) | 255 distinct values | 9 (3.41%) | |||||||||||||||||||||||||||||||||||||||||||||
| 19 | reac_t_face [numeric] | E2RT (reaction time at face onset) | Mean (sd) : 420.2 (84.8) min < med < max: 248 < 414.3 < 844 IQR (CV) : 104.8 (0.2) | 237 distinct values | 9 (3.41%) | |||||||||||||||||||||||||||||||||||||||||||||
| 20 | pct_eyes_face [numeric] | E2Eyes (percent time on eyes) | Mean (sd) : 0.7 (0.2) min < med < max: 0.1 < 0.7 < 1 IQR (CV) : 0.2 (0.3) | 255 distinct values | 9 (3.41%) | |||||||||||||||||||||||||||||||||||||||||||||
| 21 | pct_mouth_face [numeric] | E2Mouth (percent time on mouth) | Mean (sd) : 0.1 (0.2) min < med < max: 0 < 0.1 < 0.8 IQR (CV) : 0.2 (1.1) | 235 distinct values | 9 (3.41%) | |||||||||||||||||||||||||||||||||||||||||||||
| 22 | reac_t_target [numeric] | E3RT | Mean (sd) : 288.2 (58.4) min < med < max: 182 < 281.5 < 484 IQR (CV) : 69.5 (0.2) | 75 distinct values | 166 (62.88%) | |||||||||||||||||||||||||||||||||||||||||||||
| 23 | dist_target [numeric] | E3Dist (mean length of the first saccade to target) | Mean (sd) : 187 (28) min < med < max: 128.3 < 185.4 < 345 IQR (CV) : 28.1 (0.1) | 260 distinct values | 4 (1.52%) | |||||||||||||||||||||||||||||||||||||||||||||
| 24 | reac_t_pursuit [numeric] | E4RT | Mean (sd) : 330 (88.3) min < med < max: 96 < 313 < 750 IQR (CV) : 89 (0.3) | 171 distinct values | 16 (6.06%) |
- 166 missing reaction time target => excluded
- 16 missing task 4, exclude?
3.2 Score vs age
- There seems to be a strong age effect hence I stratify the following descriptive analyses by group age
3.3 Correlation between scores
Here we look at how the scores are correlated. We will focus on scores that are supposed to measure the same trait accross different tasks (eg reaction time at task 2 and 3) and scores that are mathematically linked (eg number of saccades and number of fixations).
3.3.1 Absolute correlations between all scores
- all scores that measure saccade/fixations are grouped
- the saccade equation parameters param1 and param2 highly correlated
- the task 2 qualitative indicators pct_eyes, pct_mouth and map_mean correlated
- correlations increase with age
- reaction times not correlated
David (cf notes meeting 11/03/2020)
- param1 donne une estimation de la « vitesse de base » du sujet
- param2 = facteur de mise a l’échelle en fonction de l’amplitude
- pas forcément de choix évident, on va commencer avec le param1
=> drop p2_scene & p2_face
Entre n fix, n sacc et dur moyenne fix pour qual == 1 (qui sont les 3 liées) : Garder absolumùent la durée moyenne de fixation car normalement, représente l’état attentionnel de l’enfant (tps mis a traiter l’info regardée). Un comportement exploratoire va se traduire par un gd nb de fiwx, plus courtes
=> drop sacc_n_scene, sacc_n_face, fix_n_scene & fix_n_face
Pct eyes et mouth : choisir les yeux parceque c’est la cible privilégéies de l’attention visuelle des visages statiques (pas pareil si c’est un visage parlant). Yeux utilisés sur les études du trouble du spectre autistique par ex. Indicateur a prioi plus sensible de la cignition sociale de l’enfanrt. Attention car trop peu regarder ou trop regarder les yeux peut etre le signe d’un comporteùment atypique (peut etre regarder la distance a la moyenne).
Pour la taxche 2 on va enlever le mean map car on a déjà une mesure de l’endroit de la fixation par les yeux mais qui est plus indicatrice de de la typicité du comprtement.
=> drop pct_mouth_face & map_face
Remaining scores for analysis:
fix_dur_scenep1_scenemap_scenepct_eyes_facefix_dur_facep1_facereac_time_facereac_time_pursuitnovelty_facedist_targetlook_t_scenelook_t_face
3.3.2 Correlation between tasks
Correlations on same scores/different tasks:
| rowname | score | cor5 | cor12 | cor24 |
|---|---|---|---|---|
| fix_n_scene | fix_n_face | 0.62 | 0.47 | 0.51 |
| fix_dur_scene | fix_dur_face | 0.56 | 0.48 | 0.73 |
| sacc_n_scene | sacc_n_face | 0.54 | 0.52 | 0.57 |
| reac_t_face | reac_t_pursuit | 0.40 | 0.19 | 0.28 |
| look_t_all_scene | look_t_all_face | 0.34 | 0.43 | 0.38 |
- The saccades/fixation parameters rather correlated accross task
- Map mean and reaction time not so much
- The reaction times even less
- Overall increase of correlation with age
3.4 Grouping tasks
Given the previous comments, and after excluding less interpretable variables p1_scene, p1_face and dist_target we limit ourselves to the following indicators:
fix_dur_scenemap_scenemap_facepct_eyes_facefix_dur_facereac_time_facenovelty_facereac_time_targetreac_time_pursuit
3.4.1 Number of fixations
Number of fixations for scene (fix_dur_scene) and face (fix_dur_face) tasks are comparable and will be grouped.
3.4.2 Reaction times
Reaction times at target (reac_t_target) and pursuit (reac_t_pursuit) tasks are comparable (time constrained) but reaction time at face task (reac_t_face) is different as there is no time contraint. Unfortunately reac_t_target has many missing (166) so it will have to be excluded.
We have already seen that reac_t_target has too many missing to be kept.
reac_t_face task not correlated to two others. Also distributions differ. Which confirms we cannot combine it with reac_t_pursuit.
As reac_t_pursuit and reac_t_target are more highly correlated, I suggest we keep reac_t_pursuit. UNLESS WE DECIDE THERE ARE TOO MANY MISSING AT PURSUIT TASK (5M = 7/39, 12M = 3/67, 24M = 6/151)
We could also keep reac_t_face seperately, but I believe we want to try to limit the number of indicators we are using.
Or we could combine z-scores?
3.4.3 Looking location
First we decided to analyse the percentage of times looking at eyes at the face task (pct_eyes_face) on its own because it is a well know marker of autistic spectrum disorders.
Then we want to compare the mean map location at the scene (map_scene) and face (map_face) tasks:
Distributions are similar and scores are slightly correlated, we can combine them.
3.4.4 Looking time
Finally we wanted to add an indicator of attention quality, for this we decided to look at the looking time for all fixations look_t_all_scene and look_t_all_face
3.4.5 Final groups
- Group 1:
fix_dur_scene,fix_dur_face - Group 2:
reac_time_pursuit - Group 3:
pct_eyes_face - Group 4:
map_scene,map_face - Group 5:
novelty_face - Group 6:
look_t_all_scene,look_t_all_face
3.5 Conclusions
reac_time_pursuithas 166 missing and has to be excluded- Scores are linked with age
- Some variables are correlated hence the final selection of variables
- The Correlations/groupings get clearer with age
Final groups:
- Group 1 - mean fixation duration:
fix_dur_scene,fix_dur_face - Group 2 - reaction time:
reac_time_pursuit - Group 3 - eye exploration:
pct_eyes_face - Group 4 - looking location:
map_scene,map_face - Group 5 - reaction to novelty:
novelty_face - Group 6 - attention quality:
look_t_all_scene,look_t_all_face
4 Correlation with other scores
4.1 Correlation with CBCL scores
Top 5 correlations:
| babylab_task | cbcl_task | cor |
|---|---|---|
| fix_n_face | anxscore | 0.20 |
| sacc_n_face | anxscore | 0.19 |
| sacc_n_scene | anxscore | 0.19 |
| reac_t_face | slescore | 0.18 |
| fix_n_scene | somscore | 0.16 |
Bottom 5 correlations:
| babylab_task | cbcl_task | cor |
|---|---|---|
| reac_t_pursuit | othscore | -0.20 |
| novelty_face | extscore | -0.20 |
| dist_target | anxscore | -0.20 |
| novelty_face | attscore | -0.21 |
| fix_dur_face | anxscore | -0.23 |
4.2 Correlation with MCHAT/MAB scores
Top 5 correlations:
| babylab_task | mchat_task | cor |
|---|---|---|
| reac_t_face | MABwordsentprodscore_y2 | 0.21 |
| sacc_n_scene | MABwordsentunderstdscore_y1 | 0.20 |
| fix_n_scene | MABwordsentunderstdscore_y1 | 0.20 |
| reac_t_face | MABwordsentlengthscore_y2 | 0.18 |
| look_t_all_scene | MABwordsentlengthscore_y2 | 0.17 |
Bottom 5 correlations:
| babylab_task | mchat_task | cor |
|---|---|---|
| pct_mouth_face | MCHATscore_y2 | -0.15 |
| novelty_face | MABwordsentlengthscore_y2 | -0.16 |
| look_t_all_face | MABwordsentunderstdscore_y1 | -0.18 |
| sacc_n_face | MABwordsentprodscore_y2 | -0.18 |
| fix_dur_scene | MABwordsentunderstdscore_y1 | -0.22 |
5 Annex: Repeated measures
5.1 Correlations
Correlation for each score between each period (5 months, 12 months and 24 months):
| task | cor_5_12 | n_1 | cor_12_24 | n_2 | cor_5_24 | n_3 |
|---|---|---|---|---|---|---|
| fix_dur_scene | 0.64 | 9 | 0.69 | 50 | 0.08 | 23 |
| pct_mouth_face | -0.30 | 6 | 0.60 | 48 | 0.38 | 20 |
| pct_eyes_face | 0.50 | 6 | 0.56 | 48 | 0.08 | 20 |
| reac_t_pursuit | 0.77 | 7 | 0.50 | 44 | 0.30 | 18 |
| fix_dur_face | 0.63 | 6 | 0.49 | 48 | 0.60 | 20 |
| sacc_n_scene | 0.61 | 9 | 0.46 | 50 | 0.07 | 23 |
| sacc_n_face | 0.57 | 6 | 0.41 | 48 | 0.24 | 20 |
| p1_scene | 0.10 | 9 | 0.39 | 50 | 0.33 | 23 |
| fix_n_face | 0.74 | 6 | 0.37 | 48 | 0.33 | 20 |
| map_face | 0.95 | 6 | 0.35 | 48 | 0.12 | 20 |
| p2_scene | 0.08 | 9 | 0.30 | 50 | 0.18 | 23 |
| map_scene | -0.18 | 9 | 0.27 | 50 | 0.21 | 23 |
| fix_n_scene | 0.55 | 9 | 0.26 | 50 | 0.40 | 23 |
| p2_face | 0.32 | 6 | 0.25 | 48 | 0.21 | 20 |
| reac_t_face | 0.37 | 6 | 0.25 | 48 | 0.15 | 20 |
| novelty_face | -0.03 | 6 | 0.24 | 48 | 0.11 | 20 |
| look_t_all_face | -0.05 | 6 | 0.23 | 48 | -0.14 | 20 |
| p1_face | 0.07 | 6 | 0.21 | 48 | 0.17 | 20 |
| dist_target | 0.28 | 8 | 0.14 | 48 | 0.07 | 21 |
| look_t_all_scene | -0.28 | 9 | -0.06 | 50 | -0.12 | 23 |
5.2 Visualisation
5.3 Conclusions
Not sure yet what we can do with the repeated measures.